CDS

Accession Number TCMCG075C19637
gbkey CDS
Protein Id XP_017979247.1
Location complement(join(20059359..20059988,20060814..20061719))
Gene LOC18596233
GeneID 18596233
Organism Theobroma cacao

Protein

Length 511aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018123758.1
Definition PREDICTED: cytochrome P450 71A1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category Q
Description Belongs to the cytochrome P450 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction R07470        [VIEW IN KEGG]
R07474        [VIEW IN KEGG]
KEGG_rclass RC00661        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00199        [VIEW IN KEGG]
KEGG_ko ko:K20623        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00905        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00905        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005783        [VIEW IN EMBL-EBI]
GO:0005789        [VIEW IN EMBL-EBI]
GO:0012505        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0031090        [VIEW IN EMBL-EBI]
GO:0031984        [VIEW IN EMBL-EBI]
GO:0042175        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044422        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044425        [VIEW IN EMBL-EBI]
GO:0044432        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044446        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0098827        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGAAACTCCGTCTTGGGCTTCTTACTTAGCAGCATGGCTTGCCACAATAGCCCTCGTCCTCCTCTCCCTCCGTCTCCGCCGTCGCCGTAAATTAAACTTGCCGCCGGGTCCAAAGCCCTGGCCCATAATCGGCAACCTCAACCTCATCGGCTCGCTTCCCCACCGATCCATCCATGCCCTTTCCCAAAAATACGGGCCCATCATGCAACTTCGGTTCGGGTCATTCCCCGTTATCGTGGGCTCTTCTGTTGAAATGGCCAAGGCCATTCTTAGAACTCATGATGTTGCCTTCGCTGGCCGGCCTAAAATTGCTGCAGGCAAATATACTACTTACAATTACTCTGACATTACATGGTCGCCGTACGGCCCGTATTGGCGTCAAGCACGTAAAATGTGCCTAACGGAACTTTTTAGTGCGAAACGCCTGGAGTCATACGAGTATATCCGAAGAGAAGAAATGAATTTGTTGCTAAAAGGCTTGTGCAACTTATCCGGTTCCCCGATTTGTTTGAAAGATCATCTTTCGAGTTTGAGTCTTAACGTAATCAGTAGGATGGTGTTGGGGAAAAAATACACGGAGGGGACTGGTGAAAATGAGATTGTCACCCCAAAGGAGTTCAAGGAGATGCTTGACGAGTTGTTCCTGCTTAACGGGGTGCTGGATATAGGCGACTCGATTCCCTGGCTCAGTTTCCTGGATTTGCAAGGTTATATTAAGAGAATGAAGGCCCTGAGCAAGAAGTTCGACAGATTCTTGGAGCACGTTTTGGATGAACATAATGCTAGGAGAAAAGGGGTCAAAGATTATGTTGCTAAGGATATGGTGGATGTGCTTTTGCAGCTTGCTGATGATCCCCATCTTGATGTCAAGCTTGAAAGGCATGGTGTTAAGGCATTTAGTCAGGATTTGATAGCTGGTGGAACCGAGAGTTCAGCAGTGACCGTAGAATGGGCAATTTCGGAGCTTTTGAAAAAGCCAGAAATTTTTGCAAAGGCCACGGAAGAACTAGACAGGGTAATCGGCAGAGATAGATGGGTAGAAGAAAACGACATTGCGAACCTACCCTACGTCAACTCAATTGCTAAAGAGACTATGCGTTTGCACCCTGTGGCACCCATGCTGGTGCCTCGCCTTGCTCGAGAAGACTGCCAACTAGCTGGTTATGACATTCCTAAGGACACTAGAGTTCTTGTAAACGTATGGACAATCGGGAGAGACCCTACTCTTTGGGACAACCCCGATGAATTTTGCCCCGACAGATTCATTGGGAAGGCTATCGATGTCAAAGGTCATGATTTTGAGCTGTTGCCGTTTGGGGCTGGAAGGAGGATGTGCCCTGGATATCCTCTCGGGATTAAGGTCATTCAAGCAAGTTTGGCTAATCTTTTACATGGGTTTACTTGGAAATTGCCTGGAAACATGGCAAAAGAAGATCTCAATATGGAGGAAATTTTCGGGCTTTCCACCCCTAAAAAATTCCCACTTGAGGCTGTGGCACAGCCTAGGCTCCCACTTCACATGTACTCTCAGTGA
Protein:  
METPSWASYLAAWLATIALVLLSLRLRRRRKLNLPPGPKPWPIIGNLNLIGSLPHRSIHALSQKYGPIMQLRFGSFPVIVGSSVEMAKAILRTHDVAFAGRPKIAAGKYTTYNYSDITWSPYGPYWRQARKMCLTELFSAKRLESYEYIRREEMNLLLKGLCNLSGSPICLKDHLSSLSLNVISRMVLGKKYTEGTGENEIVTPKEFKEMLDELFLLNGVLDIGDSIPWLSFLDLQGYIKRMKALSKKFDRFLEHVLDEHNARRKGVKDYVAKDMVDVLLQLADDPHLDVKLERHGVKAFSQDLIAGGTESSAVTVEWAISELLKKPEIFAKATEELDRVIGRDRWVEENDIANLPYVNSIAKETMRLHPVAPMLVPRLAREDCQLAGYDIPKDTRVLVNVWTIGRDPTLWDNPDEFCPDRFIGKAIDVKGHDFELLPFGAGRRMCPGYPLGIKVIQASLANLLHGFTWKLPGNMAKEDLNMEEIFGLSTPKKFPLEAVAQPRLPLHMYSQ